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SUMMARY 


In  the  course  of  analysis  of  macroinvertebrate  data  obtained  during  ecological  surveys 
conducted  during  the  project,  a scheme  ot  data  analysis  was  developed.  The  following  report  de- 
scribes a computer  program  developed  to  perform  a portion  of  this  data  analysis. 


CONTENTS 


Page 

1.  INTRODUCTION 5 

II.  SPECIES  DIVERSITY  MEASURES 5 

A.  Shannon  Diversity  Index 5 

B.  Simpson  Diversity  Index  6 

C.  McIntosh  Diversity  Index 

III.  DESCRIPTION  OF  DICALC  PROGRAM 7 

IV.  DESCRIPTION  OF  PROGRAM  OPTIONS g 

A.  Sampling  Methods 9 

B.  Data  To  Be  Considered 9 

V.  DESCRIPTION  OF  DATA  CARDS 10 

A.  Title  Card 10 

B.  Program  Option  Cards 10 

C.  Substation  Number  Cards 10 

D.  Station  or  Substation  Name  Cards 10 

E.  Species  Names  and  Number  Cards 12 

F.  Format  Cards 12 

G.  Datacards 13 

VI.  ARRANGEMENT  OF  DICALC  PROGRAM 13 

VII.  SAMPLE  OUTPUT 13 

VIII.  CONCLUSION 16 

LITERATURE  CITED 17 

i 

APPENDIX,  DICALC  PROGRAM  LISTING 19 

DISTRIBUTION  LIST  31 


LIST  OF  FIGURES 


Figure 

1 Title  Card 10 

2 Program  Option  Card 11 


3 


II 


3 

4 

5 

6 

7 

8 


Alternative  Sampling  Method  Option  Card 

Substation  Number  Card 

Station/Substation  Name  Card 

Species  Name/Number  Card 

I'ormat  Card 

Sample  Output  From  DICALC  Program  . 


1 


* 12 
. 12 
. 13 
. 14 


4 


A COMPUTER  PROGRAM  FOR  THE  ANALYSIS 
OF  MACROINVERTEBRATE  DATA  FROM  WATER  QUALITY  SURVEYS 


I.  INTRODUCTION. 

Tlie  structure  and  organization  of  biological  communities  are  of  great  interest  to  scien- 
tists involved  in  the  study  of  these  communities.  Community  analysis  has  long  been  a part  of  the 
field  of  pollution  biology  because  one  of  the  basic  assumptions  of  community  analysis  is  that  pol- 
luted communities  have  different  species  occurrences  and  abundances  than  nonpolluted  communities. 
Species  diversity  and  density  measurements  are  routinely  made  by  pollution  biologists  during  pol- 
lution surveys.  The  following  is  a description  of  a computer  program  that  calculates  some  of  these 
parameters. 

II.  SPECIES  DIVERSITY  MEASURES. 

A.  Shannon  Diversity  Index. 

The  Shannon  diversity  index  is  derived  from  information  theory'"^  as 


= ->  — logj  — 

n ^ n 


where  n is  the  total  number  of  individuals  in  k species,  and  n.  is  the  number  of  individuals  in  the  /th 
species.  The  relative  maximum  of  d is  assumed  when  the  «.(/=  1, . . .,  A) are  uniformly  distributed 
within  the  community,  hence 


The  relative  minimum  is  calculated  according  to 


d . = log, 
min  ^2 


logj  (n  - A + 1 ) 


The  relative  efficiency,  synonymous  with  the  normed  version  ofd,or  evenness,'*  is  the 
measure  of  the  ratio  of  the  sample  distribution  of  n.  to  the  theoretical  uniform  distribution  of  n.  in 


En.  n. 

— log,  — 
n n 


log,  A 


The  redundancy  is  the  complement  of  the  efficiency: 
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B.  Simpson  Diveisity  Index. 


The  Simpson  diversity  index  is  defined  as^ 


_ I ^ ' ) 
2-u  n (/!  - 1) 


where 


equations: 


The  relative  maximum  and  minimum  are  calculated  according  to  the  following 


Mk-\) 


k(n-\) 


The  normed  version  of  equation  6 is 


S = (n 

«(1  - 1/A) 


and  the  redundancy  is  again  the  complement  of  the  normed  version  of  the  diversity  index, 


S=l-S 


C.  McIntosh  Diversity  Index. 


The  McIntosh  diversity  index  assumes  samples  are  taken  from  an  infinite  population, 
where  each  sample  contains  n individuals  in  k different  species  present,  n.  individuals  in  the  /th 
species,  and 


The  Huclidean  measure  is 


where  p.  is  the  proportion  of  individuals  that  belong  to  the  /th  species, 


k 


E 

i = I 


Pi 


= 1 


(10) 


and  where  \jk  < A < 1 . 


The  McIntosh  measure  of  species  diversity  is  then  defined  to  be 

(/  = 1 - A 


(ID 


As  each  p.  -*■  1/A:,  U attains  its  relative  maximum, 


U = 1 - — 

max  ^ 


(12) 


The  relative  minimum  is  attained  according  to  the  least  uniformly  distributed  combination  of  the 
/;.(/=  1,2,  . . .,  k)  in  n: 

k - 1 +s/k  - 1 


(13) 


The  ratio  of  the  calculated  species  diversity  U to  its  relative  maximum  is  the 
normed  version  of  U: 


U = 


1 


1-1/^ 


- 


VI /2 


, < = I 


(14) 


The  redundancy  (the  measure  of  the  deviation  of  sample  distribution  with  respect  to 
the  true  uniform  distribution)  is  again  the  complement  of  the  normed  version: 


U=\-U 


(15) 


111.  DESCRIPTION  OF  DICALC  PROGRAM. 

The  FORTRAN  V program  DICALC  (see  appendix)  was  developed  for  analysis  of  bio- 
logical data  and  comparison  of  population  and  ecological  studies.’  DICALC  originally  included 
complete  calculations  of  species  occurrence,  relative  abundance  at  a sample  station,  and  measures  of 


species  diversity  for  analysis  of  the  sample  distribution.  In  addition  to  these  listings,  the  new  version 
of  DICALC  includes  the  computations  for  the  approximate  density  of  the  species  at  each  station  and 
for  the  relative  and  absolute  abundance  of  each  of  the  species.  DIC'ALC  also  calculates  the  grand 
total  for  all  of  the  stations,  determines  the  overall  species  diversity,  the  relative  and  absolute  abun- 
dances, and  the  approximate  density  of  each  species  in  the  sampling  area. 

DIC'ALC  can  accommodate  a maximum  of  150  samples  and  60  different  species.  It 
greater  numbers  of  either  are  necessary,  the  program  may  be  easily  modified  by  use  of  the 
parameters  NSIZF.  and  MSPFX',  the  maximum  number  of  samples  and  the  maximum  number  of 
species,  respectively.  By  changing  the  value  of  either  or  both  parameters,  the  amount  ot  storage  is 
changed  without  further  modifications  to  the  program. 

The  following  is  a list  of  FORTRAN  variables  used  in  this  version  of  DICALC  listed  in 
the  order  entered  for  input  and  in  tlie  order  printed  for  output: 


Variable  name 

Description 

Type 

hipitt 

NS 

Number  of  different  species  present 

15 

NSS 

Number  of  stations 

15 

LSStJ) 

Number  of  substations  for  each  station 

1615 

TITLl 

Site  location 

12A6 

PROC, 

Data  to  be  considered 

A6 

MFTHOl) 

Type  of  sampling  method 

A6 

ARFA 

Numerical  surface  area  of  sampler 

FI  0.6 

S1.A7 

Station  names 

5A6 

SPFC 

Species  names 

5A6 

JM 

Species  numbers 

12 

FMT 

Format  by  which  to  read  data 

13A6.  A2 

Output 

X(I.J) 

Data  matrix 

Sec  FMT 

NPA 

Number  of  individuals  per  unit  area  for  each  station 

11 0.2 

TNPA 

Total  number  of  individuals  per  unit  area 

FI  0.2 

A A 

•Absolute  abundance 

FI  0.2 

XN 

Number  of  individuals  for  each  station 

FT  0.0 

XP 

Relative  abundan''e 

F10.2 

xs 

Number  of  species  present  at  each  station 

FI  0.0 

H 

Shannon  diversity  index  (designated  d in  section  II. A) 

F10.4 

HM 

Relative  maximum 

FI  0.4 

llMl 

Relative  minimum 

FT  0.4 

HF 

Relative  efficiency  (normed  index) 

F10.4 

HRR 

Redundancy 

F10.4 

SL 

Simpson  lambda 

FT  0.4 

S 

Simpson  diversity  index 

FI  0.4 

SM 

Relative  maximum 

FI  0.4 

SMI 

Relative  minimum 

FI  0.4 

Variable  name 


Description 


Type 


SE 

Relative  efl'iciency  (tiormed  index) 

El  0.4 

SRR 

Redundancy 

El  0.4 

LID 

McIntosh  delta 

El  0.4 

U 

McIntosh  diversity  index 

E10.4 

UM 

Relative  maximum 

El  0.4 

UMI 

Relative  minimum 

ET0.4 

UE 

Relative  efficiency  (normed  index) 

E10.4 

URR 

Redundancy 

El  0.4 

IV.  DESCRIPTION  OF  PROGRAM  OPTIONS. 

A.  Sampling  .Methods. 

Three  frequently  used  methods  of  collecting  macroinvertebrates  are  programmed  into 
DICALC.  One  of  these  methods  may  be  selected  for  the  calculations  of  the  number  of  individuals 
per  unit  area.  If.  however,  none  of  these  three  standard  sampling  methods  is  used,  ♦ numerical 
value  of  the  surface  area  of  the  sampler  may  be  read  in  as  a variable. 

The  following  options  may  be  selected  for  the  METHOD  variable; 

1.  EK.MAN-This  option  is  for  the  use  of  any  6-  by  6-inch  grab  method.  The  total 
area  of  the  grab  is  0.02323  m^. 

2.  SURBFR-This  option  is  for  the  use  of  any  12-  by  12-inch  bottom-sampling 
method.  The  area  of  the  sampler  is  0.09290  m*. 

3.  DENDY-This  option  is  for  the  Hester-Dendy  multiplate  samplers.  This  method 
uses  eight  7.75-  by  7.75-cm  plates,  each  0.3  cm  apart,  and  seven  2.60-  by  2.60-cm  spacers.  The  sur- 
face area  is  0.08664  nr. 

4.  OTHER -This  is  for  any  collection  method  in  which  the  total  sampling  area  is 
known  and  given. 

B.  Data  To  Be  Considered. 

The  following  options  may  be  selected  for  the  PROG  variable: 

1.  TfJTALS -With  the  “TOTALS”  option,  the  program  calculates  the  number  of 
individuals  per  unit  area,  the  relative  abundance,  the  absolute  abundance,  and  the  different  diversity 
measures  for  the  station  totals. 

2.  SUBTOT-With  the  “SUBTOT”  option,  the  program  calculates  the  number  of 
individuals  per  unit  area,  the  relative  abundance,  the  absolute  abundance,  the  different  diversity 
measures  for  each  replicate  in  the  sampling  station,  and  the  overall  station  total. 
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m 
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3.  CSRl— The  “CSKl”  option  calls  the  subroutine  CSR,  which  calculates  the 
cumulative  sums  of  the  replicates  for  each  station.  The  program  then  uses  these  cumulative  sums  to 
calculate  the  cumulative  number  of  individuals  per  unit  area,  the  relative  abundance,  and  the 
different  diversity  measures.  With  the  addition  of  each  replicate,  the  change  in  diversity  and  the 
number  of  species  present  can  be  seen.  This  option  is  also  useful  in  determining  the  number  ol 
replicates  that  adeciuately  sample  the  biological  community. 

V DESCRIPTION  OF  DATA  CARDS. 

A.  Title  Card. 

Tlie  title  card  shown  in  figure  1 includes  location,  sample  station,  date,  and  method  of 
sampling  in  format  12.A6. 

B . Program  Option  Cards. 

If  using  I'KM.AN,  DENDY,  or  SURBF  R method  of  sampling,  tlie  card  shown  in  figure  2 
is  used.  If,  however,  the  numerical  value  for  the  surface  area  is  to  be  read  in,  then  the  card  sliown  in 
figure  3 is  used. 

C.  Number  of  Substations  in  Each  Station. 

The  number  of  replicates  at  each  station  must  be  supplied  in  the  card  format  sliown 

m figure  4. 

There  must  be  the  same  number  of  entries  as  there  are  stations  (ISS(J)  where  J = 1 

NSS).  Use  as  many  cards  as  necessary  to  include  all  substations. 

D.  Station  or  Substation  Name  Cards. 

The  number  of  these  cards  must  eiiiial  the  number  of  stations  if  PROd  equals  TOTALS, 
must  equal  the  number  of  substations  if  PROG  equals  CSRl,  and  must  equal  the  number  of  sub- 
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18  20 

26  28 

“ 

Figure  3.  Alternative  Sampling  Method  Option  Card 
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No.  of  sub- 
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P__ 

No.  of  sub- 

stations  in 

stations  in 

stations  in 

station  2 

station  3 

last  station 

ISSI2) 

ISS(3) 

ISS(NSS) 

Figure  4.  Substation  Number  Card 
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30 


stations  plus  the  number  of  stations  if  PROG  equals  SUBTOT.  The  appearance  of  station/substation 
name  eards  is  shown  in  figure  5. 

E.  Species  Names  and  Number  Cards. 

For  each  species,  a card  appearing  as  in  figure  6 provides  the  species  name  and  a 2-digit 
number  as  identifiers.  The  number  of  these  cards  must  equal  the  number  of  different  species  present 
(NS). 

F.  Format  Cards. 

The  format  by  which  data  are  to  be  read  in  is  determined  from  the  format  card  ( figure  7 ). 
Two  format  cards  must  be  included  in  the  card  set  even  if  the  format  instructions  only  use  one. 
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Figure  7.  Format  Card 


G.  Data  Cards. 


Data  are  read  into  a data  matrix  according  to  the  format  cards  described  in  the  preceding 
section.  There  must  be  at  least  as  many  data  cards  as  there  are  species  name  cards;  i.e.,  there  must 
be  at  least  one  individual  present  for  each  species  named. 

VI  ARRANGEMENT  OF  DICALC  PROGRAM. 


The  order  in  which  program  card  sets  appear  in  the  card  deck  is  as  follows: 


Card  set 


Description 


I 

-> 

3 

4 

5 

6 

7 

8 

9 

10 
I I 

Final 


Job  control  cards 
DICALC-main  program  deck 
Control  card(s) 

CSR  subroutine 

Title  card 

Option  card 

Substation  card 

Station/substation  name  card 

Species  name 

Format  cards 

Data  cards 

End  of  file  control  card 


VII.  SAMPLE  OUTPUT. 


A sample  of  the  output  from  DICALC  is  shown  in  figure  8. 
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Figure  8.  Sample  Output  From  DICAI  C Program 
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Figure  8 (Concluded).  Sample  Output  From  DIC'ALC  Program 


vm.  CONCLUSION. 


Macroinvertebrate  communities  are  commonly  analyzed  by  calculating  species  diversity 
and  relative  abundance.  With  the  modifications  presented  here,  DICALC,  using  these  widely  accepted 
methods,  permits  the  user  to  swiftly  calculate  species  diversity  and  relative  abundance.  In  addition, 
data  can  be  expressed  in  standard  density  units  (number  of  individuals  per  scpiare  meter).  The  addi- 
tional calculations  ol  relative  efficiency,  redundancy,  and  relative  maximum  and  minimum  help  the 
user  to  identity  peculiarities  about  the  distribution  of  individuals  in  each  sample.  Absolute  abun- 
dance shows  the  proportion  of  individuals  of  a particular  species  collected  in  one  of  the  samples 
during  a study.  This  information  permits  the  user  to  evaluate  the  numerical  significance  of  any 
species  in  any  sample.  DICALC  is  a versatile  program  that  is  very  helpful  in  investigating  differences 
in  community  structure  between  sampling  sites.  When  these  data  are  correlated  with  environmental 
factors  such  as  water  quality,  cause-and-effect  relationships  can  be  determined. 
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Washington.  DC  20372 

Commander 

Naval  Air  Systems  Command 

1 Attn:  Code  AIR-3S0L 

Attn:  Code  AIR-53231  B 

Washington,  DC  20361 

DLPARTMI.NT  Ol  Till.  AIR  I ORCt 

1 

2 HQ  I oreign  Technology  Division  (Al’SC) 

Attn:  PDRR 

Wright-Patterson  ALB,  OH  45433 

HQ.  USAF/SGPR 
I'onestal  Bldg 
WASH  DC  20314 

I Director 

Air  Force  Inspection  and  Safety  Center 
Attn:  IGDIAl  ISC/StV) 

Norton  AFB,  CA  92409 

1 

Commander 

Armament  Development  & Test  Center 
Attn:  DLOSL  (Technical  Library) 
KgUnAFB.FL  32542 
I 

OUTSIDK  AGFNCIFS 

Director  of  Toxicology 
National  Research  Council 
I 2101  Constitution  Ave.  NW 

Washington,  DC  20418 

US  Public  Health  Service 
Room  I 7.A46  (CPT  Osheroff) 

I 5600  Fishers  Lane 

1 Rockville,  MI)  20857 

2 

ADDITIONAL  ADDRFSSFFS 
Commander 

USDRC  Installations  & Services  Agency 
I Attn:  DRCIS-Rl-lU 

Rock  Island.  IL  61201 


DISTRIBUTION  LIST  I OR  ARCSl.  I R-77035  ((\>nid) 


(ommander 

US  Army  Maicriol  IXrvclopment  & 

Readiness  ( ominand 

Munition  Production  Base,  Modernization  & Expansion 
Attn  I)RCPM-PBM-KC/Project  Manager 
Alin:  DRCPM-PBM-EB/Mr.  A.  Selman 
Dover.  NJ  07801 

Commander 

DA-DC'SLOC; 

Attn;  DALO-AMP 
Wasliington,  DC  20310 

Department  ol  the  Army 
Attn:  DAI-:N-ZCK 
Washington.  IK'  203\0 

Commander 

l!S  Army  Construction  Engineering 
Research  Laboratory 
Attn;  CFRL-FR 
Champaign.  !L  61820 

Otfice  ot  Oiiel  o!  Engineers 
Attn:  DA1:N-MCZ-F 
Washington.  IX’  20314 

District  Engineer 
US  Army  Corps  of  Engineers 
6014  US  PO  & Courthouse 
215  N I 7th  Street 
Omaha.  NB  68102 

District  Engineer 
US  Army  Corps  of  Engineers 
Attn:  HNDED-M 
Attn:  MNDEIMC 
PO  Box  1600  West  Station 
Huntsville,  AL  35807 

Commander 

Holston  Army  Ammunition  Plant 
Attn:  SARHO-E 
Kingsport.  TN  47662 

Commander 

Milan  Army  Ammunition  Plant 
Attn:  SARMI-S 
Milan.  TN  38358 

Commander 

Newport  Army  Ammunition  Plant 
Attn:  SARNE^S 
Newport.  IN  47966 

Commander 

Radford  Army  Ammunition  Plant 
Attn:  SARRA-IF 
Radford.  V A 24141 

Commander 

Sunflower  Army  Ammunition  Plant 
Attn:  SARSU-O 
I-awrcnce.  KS  66044 


Copies  Names 

Commander 

Volunteer  Army  Ammunition  Plant 
Alin  SARVO-I 
Cllallanooga,  IN  37401 

1 Environmental  ProieUion  Agency 

I Attn  Mr  Harry  Ike 

26  l ederal  Pla7j/Rm  837 
New  York.  NY  KHHI7 

Environmental  Protection  Agency 
I Attn:  Mr  Ben  Lacey 

C'urtis  Building,  2d  floor 
6th  and  Walnut  Sts 
Philadelphia.  PA  19106 

* Environmental  Protection  Agency 

Attn;  Mr.  Arthur  Linton 
1421  Peachtree  Street.  NE 
Atlanta,  GA  30309 

I nviromnemal  Protection  Agency 
I Attn.  Mr  Don  A.  Wallgren 

1 North  Wacker  Drive 
Oiicago,  IL  60606 

Environmental  Protection  Agency 
^ Attn:  Mr.  James  Highland 

160n  Pattersrm  Street 
Dallas,  TX  752iH 

1 

Environmental  Protection  Agency 
Attn:  Mr.  Robert  Koke  (RM  2491 
1735  Baltimore  Street 
Kansas  City,  MO  64108 

Environmental  Protection  Agency 
Attn;  Mr  Keith  Synder 
j Lincoln  Tower  Suite  900 

I I860  Lincoln  Street 

Denver.  (’O  802f>3 

Environmental  Protection  Agency 
Attn:  Mr  Irving  lerzuh 
100  California  Street 
San  I rancisco.  ( A 941 1 I 

1 

Army  Logistics  Management  Center 
Environmental  Management 
Attn;  LCDR/J.C.  Bolander 
Fort  Lee,  VA  23801 

* US  Army  Fngineer  Waterways 

Experimental  Station 
EwviTOTVTTvental  Systems  Division 
PO  Box  631 

^ Vicksburg.  Ml  39180 

Director 

Center  for  Environmental  Studies 
Virginia  Polytechnic  Institute  & 

State  University 
1 Blacksburg.  VA  24061 


